Two step speaker segmentation method using Bayesian information criterion and adapted Gaussian mixtures models

نویسندگان

  • Matej Grasic
  • Marko Kos
  • Andrej Zgank
  • Zdravko Kacic
چکیده

This paper addresses the topic of online unsupervised speaker segmentation in a complex audio environment as it is present in the Broadcast News databases. A new two stage speaker change detection algorithm is proposed, which combines the Bayesian Information Criterion with an ABLS-SCD statistical framework where adapted Gaussian mixture models are used to achieve higher accuracy. To enhance the performance of the proposed method a sub-window dependent threshold selection strategy for the ABLS-SCD is introduced. Also an additional window selection strategy for the proposed method is presented. Experimental design and test evaluation were carried out on the Slovenian BNSI Broadcast News database.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker diarization using normalized cross likelihood ratio

In this paper, we present the Normalized Cross Likelihood Ratio (NCLR) and the advantages of using it in a speaker diarization system. First, the NCLR is used as a dissimilarity measure between two Gaussian speaker models in the speaker change detection step and its contribution to the performance of speaker change detection is compared with those of BIC and Hostelling’s T-Statistic measures. T...

متن کامل

Speaker diarization using bottom-up clustering based on a parameter-derived distance between adapted GMMs

In this paper, we present an approach for speaker diarization based on segmentation followed by bottom-up clustering, where clusters are modeled using adapted Gaussian mixture models. We propose a novel inter-cluster distance in the model parameter space which is easily computable and which can both be used as the dissimilarity measure in the clustering scheme and as a stop criterion. Using ada...

متن کامل

A system for the segmentation and transcription of Italian Audio News

This paper presents the development of an Italian broadcast news transcription system, to be applied for the indexing of multimedia archives. Moreover, a broadcast news corpus under collection at ITC-irst is introduced. The system processes the input audio stream in four stages. The first one performs audio segmentation via the Bayesian Information Criterion (BIC) and classification by Gaussian...

متن کامل

A System for the Segmentation and Transcription of Italian Radio News

This paper presents the development of an Italian broadcast news transcription system, to be applied for the indexing of multimedia archives. Moreover, a broadcast news corpus under collection at ITC-irst is introduced. The system processes the input audio stream in four stages. The first one performs audio segmentation via the Bayesian Information Criterion (BIC) and classification by Gaussian...

متن کامل

Unsupervised speaker segmentation of broadcast news using MDL-based Gaussian model

This paper proposes an approach for unsupervised speaker segmentation and gender discrimination of broadcast news. In this paradigm, a speaker segmentation mechanism using MDL-based Gaussian model is firstly adopted to determine the speaker changes using mean and covariance of the Gaussian model. These speaker segments partitioned by speaker changes are smoothed and discriminated into male or f...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008